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ABSTRACT 

Objective: We performed a systematic review, meta- 
analysis and assessed the evidence supporting a 
causal link between knee joint loading during walking 
and structural knee osteoarthritis (OA) progression. 
Design: Systematic review, meta-analysis and 
application of Bradford Hill's considerations on 
causation. 

Data sources: We searched MEDLINE, Scopus, 
AMED, CINAHL and SportsDiscus for prospective 
cohort studies and randomised controlled trials (RCTs) 
from 1950 through October 2013. 
Study eligibility criteria: We selected cohort studies 
and RCTs in which estimates of knee joint loading 
during walking were used to predict structural knee OA 
progression assessed by X-ray or MRI. 
Data analyses: Meta-analysis was performed to 
estimate the combined OR for structural disease 
progression with higher baseline loading. The 
likelihood of a causal link between knee joint loading 
and OA progression was assessed from cohort studies 
using the Bradford Hill guidelines to derive a 0-4 
causation score based on four criteria and examined 
for confirmation in RCTs. 

Results: Of the 1078 potentially eligible articles, 5 
prospective cohort studies were included. The studies 
included a total of 452 patients relating joint loading to 
disease progression over 12-72 months. There were 
very serious limitations associated with the 
methodological quality of the included studies. The 
combined OR for disease progression was 1.90 (95% 
CI 0.85 to 4.25; l 2 =77%) for each one-unit increment 
in baseline knee loading. The combined causation 
score was 0, indicating no causal association between 
knee loading and knee OA progression. No RCTs were 
found to confirm or refute the findings from the cohort 
studies. 

Conclusions: There is very limited and low-quality 
evidence to support for a causal link between knee 
joint loading during walking and structural progression 
of knee OA. 

Trial registration number: CRD4201 2003253 



Strengths and limitations of this study 



We performed the first systematic evaluation of 
the evidence for a causal link between mechan- 
ical loading of the knee during walking and 
structural progression of knee osteoarthritis. 
This systematic review, meta-analysis, and caus- 
ation analysis find very limited evidence of a 
causal link between knee joint loading during 
walking and structural progression of knee 
osteoarthritis. 

Few studies were included in the meta-analysis 
since only a small number of studies have been 
conducted. Further research is needed. 



BACKGROUND 

Osteoarthritis (OA) is the most common 
form of arthritis affecting a large section of 
the population, and is a major cause of 
illness and disability. 1 2 The knee joint is 
most frequently affected, 3 and due to the 
knee's crucial role in independent ambula- 
tion, knee OA leads to considerable disability 
affecting an individual's participation in 
society and independent living. Because no 
cure is available and many people are 
affected, healthcare costs associated with 
knee OA are enormous — even higher than 
for the more high-profile diseases such as 
diabetes, cancer and cardiovascular diseases. 4 
With its prevalence on the rise, knee OA 
poses a substantial socioeconomic and public 
health burden. 

Walking is the most common form of 
human locomotion, and most people walk 
every day, resulting in millions of steps per 
year. Biomechanical loading on the knee 
joint during walking is estimated to exceed 
2-3 times body weight 5 ; thus, knee joint 
loading is unavoidable component of an 
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independent lifestyle. One widely accepted theory is that 
higher knee joint loading during walking is causally 
linked with accelerated structural knee OA progres- 
sion. 6-8 In accordance with this theory unloading is 
advocated in an attempt to slow or halt disease progres- 
sion. 9 Unspecific measures of high joint loading, such as 
excess body mass and body mass index (BMI), have 
been associated with OA development, 10 11 and obesity 
is believed to act mainly through high mechanical 
loads, 12 although systemic effects have also been indi- 
cated. 13 However, higher body mass does not necessarily 
lead to higher joint loading during walking, 14 and more 
specific measures of joint loading are necessary to apply. 

Objective estimates of joint loading during walking 
can be obtained by three-dimensional gait analysis. 
Typically, the knee adduction moment (KAM) or bio- 
mechanical modelling of compression forces is used. 
The KAM has been of particular interest because it 
reflects the medial to lateral joint load distribution, 15 
with good face validity with respect to the relative preva- 
lence of medial, as opposed to lateral compartment, 
tibiofemoral knee OA. 16 Despite its common usage, a 
recent systematic review did not find consistent evidence 
to the fact that the KAM differs between those with and 
without knee OA. 17 

The relationship between knee joint loading during 
walking and knee OA has been a focus of knee 
OA-related biomechanical research for 10-20 years. 
Studies reveal that symptomatic treatment of knee OA 
results in increases in knee joint loads during 
walking, 5 18-20 which are unwanted according to the 
hypothesised structural consequences of increased joint 
loading. This may generate confusion among research- 
ers, clinicians and patients, who are interested in this 
information to aid them in prevention, palliation and 
treatment strategies. Establishing causal relationship 
among modifiable factors, such as knee joint loading, 
and disease-specific measures, such as structural disease 
progression, is imperative in the generation of effective 
strategies. 

Seminal epidemiological work by Hill proposed a set 
of considerations (the Bradford Hill criteria) to systemat- 
ically evaluate the existence of a causal link between an 
exposure and a health outcome. 21 These criteria have 
been previously employed to demonstrate the causal 
link between smoking and lung cancer, 22 and between 
dietary factors and coronary heart disease, 23 and have 
the capacity to evaluate the link between knee joint 
loading and structural knee OA progression. 

Therefore, the objectives of this study were (1) to sys- 
tematically evaluate the evidence supporting a causal 
link between exposure to knee joint loading during 
walking and structural progression of knee OA based on 
Hill's considerations on causality 21 and (2) to determine 
which knee joint load variables had been studied suffi- 
ciently in randomised controlled trials (RCTs) and 
found to support the findings of prospective cohort 
studies. 



METHODS 

The methods of the study search strategy, inclusion cri- 
teria and data analysis were prespecified in a protocol 
(see online supplementary file 1) and pre-registered 
(PROSPERO 2012:CRD42012003253). 

We searched MEDLINE, Scopus, AMED, CINAHL and 
SportsDiscus for prospective cohort studies and RCTs 
from 1950 through October 2013. Search strategies are 
available as an online supplementary file 2. The refer- 
ence lists of the retrieved articles were also searched for 
additional cohort studies and RCTs. Two reviewers (MH 
and MWC) independently assessed study eligibility. 
Excluded studies and reasons for exclusion were 
recorded. Disagreement was resolved by discussion and 
consensus. We included original full-length articles per- 
taining to the association between knee joint loading 
during walking, measured by three-dimensional gait ana- 
lysis, and structural disease progression assessed quanti- 
tatively or semiquantitatively by X-ray or MRI. We 
considered only those studies that included participants 
diagnosed with knee OA that were followed for at least 
1 year (necessarily because of the sensitivity of imaging 
and slow disease progression 24 ). Cohort studies had to 
include estimates of knee joint loading during walking 
at baseline and imaging-based assessment of structural 
disease progression. Clinical trials had to be randomised 
and compare image-based structural disease progression 
among different knee joint loading interventions 
(eg, increase vs decrease), placebo or control. 

Data extraction and analysis 

The following data were extracted from the studies: (1) 
Study design; (2) Country of origin; (3) Number of sub- 
jects; (4) Characteristics of the subjects; (5) Type of 
knee joint loading estimate; (6) Magnitudes of the esti- 
mate; (7) Description of the interventions (if any) and 
(8) Structural disease progression outcome (ie, semi- 
quantitative or quantitative measures on X-ray and/or 
MRI). 

For prospective cohort studies, we extracted estimates 
of the association between baseline knee joint loading 
and structural disease progression from baseline to 
follow-up. A progression group is typically compared to a 
non-progression group at baseline and the results are 
typically reported as ORs for disease progression for a 
one-unit higher baseline knee joint loading. If no OR 
was reported, we either calculated it from the reported 
data (if possible), or contacted the study authors and 
requested for the OR. For the RCTs we aimed to 
compare relative risks of structural disease progression 
between knee joint loading modification and control 
groups. 

The individual study results (ie, log OR values and 
their corresponding SEs) were combined in a random 
effects meta-analysis model using the generic inverse 
variance outcome type in Review Manager. 25 
Heterogeneity between trials was assessed using the 
standard Q-test statistic (testing the hypothesis of 
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homogeneity), and we present the I 2 value, which can 
be interpreted as the percentage of total variation across 
the studies due to heterogeneity. 26 

Methodological quality and risk of bias assessment 

The methodological quality of the cohort studies was 
assessed using a published check list. 27 Two of the 
authors (MH and MWC) assessed this individually and 
judged each criterion to be 'Adequately described', 
'Unclear', or 'Inadequately described'-corresponding to 
'low risk of bias', 'unclear risk of bias', and 'high risk of 
bias', respectively. Disagreement was resolved by discus- 
sion. If an included study was authored by one or more 
of the current authors, a third reviewer (HL) was asked 
to perform a quality assessment. One quality assessment 
item was omitted ("Was a dose-response relationship 
between exposure and outcome demonstrated?") 
because this item relates to the findings of the study and 
not the methodological quality, and assessment of dose- 
response forms part of the Bradford Hill criteria for 
causation (see below). The overall extent of risk of bias 
and methodological quality within each study was 
assessed using the GRADE approach to evaluate study 
limitations. 28 

Evaluation of evidence for causality 

Based on the Bradford Hill considerations on causality, 21 
a causation score was developed and used to systematic- 
ally evaluate the evidence of a causal link between knee 
joint loading during walking and structural progression 
of knee OA. A similar score has previously been derived 
from the Bradford Hill considerations to assess the 
causal relationship between dietary factors and coronary 
heart disease. 23 The following criteria were used in the 
review of the cohort studies and given a score of 1 (cri- 
terion satisfied) or 0 (criterion not satisfied): 

1. Strength of association: Associations quantified as a 
pooled OR >5.0, with lower 95% CI above 2.0; the 
expected direction was defined as 'strong associ- 
ation.' 'Moderate association' is defined as any statis- 
tically significant pooled OR (p<0.05). A statistically 
non-significant pooled OR was defined as 'no 
association.' 

2. Consistency across studies: An association requires repli- 
cation in other studies. Consistency is defined as 
>75% of associations being strong or moderate. 

3. Temporality: Refers to temporal relationship of associ- 
ation between exposure and outcome; exposure has 
to precede outcome. It is difficult to ensure temporal 
correctness because study participants are assumed to 
walk daily and are, therefore, exposed to knee joint 
loading throughout observation periods. We retained 
this criterion because temporality is necessary to 
infer causation; absence of temporal relationship 
between exposure and outcome precludes a causal 
link. Our analysis accepted studies with a temporally 
correct design defined as baseline knee joint loading 
related to disease progression over time from that 



baseline. The temporality criterion was satisfied when 
temporal correctness was accepted in >75% of the 
included studies. 
4. Biological gradient When rate of progression increases 
(or decreases) incrementally as dose of exposure 
increases; provides strong evidence of causal relation- 
ship. This criterion is satisfied when >75% of tests for 
a trend pertaining to structural progression outcomes 
are statistically significant in the expected direction 
for knee joint loading. 
The aforementioned four criteria scores were applied 
to derive one causation score for each knee joint 
loading variable. The scores were computed as the 
unweighted sum of the scores from each of the above 
criteria, for a possible range of 0-4. A score of 4 is con- 
sidered strong evidence of a cause-and-effect relation- 
ship. A score of 3 is deemed to indicate moderate 
evidence. A score of 2 or less is considered weak evi- 
dence of causation. 

A fifth criterion, experimental evidence, was used to 
examine whether the evidence from the prospective 
cohort studies was consistent with that from RCTs. 
Experimental evidence enhances the probability of caus- 
ation, and may be used to upgrade or downgrade the 
calculated causation score. 
The following four criteria were omitted: 
Coherence. Causation is more likely if what is observed 
is supported by and in agreement with the natural 
history of the disease. This criterion is usually applied 
when the outcome is assessed by surrogate outcomes. 
This criterion is omitted because it is satisfied by default 
since imaging modalities are surrogate outcomes. 

Plausibility relates to the assessment of whether the 
association is plausible or not. This criterion is omitted 
because of the highly subjective nature of this criterion. 

Specificity relates to the specific response to the expos- 
ure. This criterion is omitted because OA is a multistruc- 
ture disease; an association between knee joint loading 
and disease progression in multiple structures does not 
preclude a possible causal relationship. 

Analogy relates to the possibility that existing similar 
association can support causation (eg, Does the same 
association exist for hip or hand OA? If so, causation 
may be supported). This criterion is omitted because 
this review focuses specifically on knee OA. 

RESULTS 

Study selection and characteristics 

The search yielded 1078 potentially eligible studies 
(figure 1; references available in online supplementary 
file 3). Of those, we included five prospective cohort 
studies published between 2002 and 2013 and involving 
a total of 452 patients with knee OA 29-33 and 0 RCTs. 
The diagnostic criteria for knee OA used to determine 
participant inclusion in individual studies varied. The 
American College of Rheumatology (ACR) criteria 34 
were used in one study, 32 whereas a combination of 
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1078 Potentially relevant articles | 



735 Articles excluded on the basis of title or abstract 
181 Not related to knee joint loading during walking 
115 Not related to knee OA 
146 Not original full-length articles 
43 Not involving human subjects 
250 Involving surgery to the knee joint as intervention 



343 Original articles on knee joint load during walking and knee OA 

t 1 C 



31 Prospective cohort studies 



12 Randomized controlled trials 



26 Excluded cohort studies 

15 Primary outcome not studied 
4 Period of follow-up was < 1year 
7 Not including biomechanical 
estimates of knee joint loading 

during walking 



300 Other study designs 
199 Cross sectional studies 
101 Other study designs 



12 Excluded RCTs 

4 Primary outcome not studied 
8 Period of follow-up was < 1year 



| 5 Included in the systematic review | | 0 Included in the systematic review | 

Figure 1 Flow chart of the literature selection process (OA, osteoarthritis; RCTs, randomised controlled trials). 



clinical and radiographic signs of knee OA was used as 
criteria in three studies, 29 31 33 and one study did not 
report the diagnostic criteria used. 30 Study character- 
istics are summarised in table 1. 

Joint loading and structural progression measures 

Four cohort studies 29-32 focused on the same knee joint 
loading estimate: the peak KAM. In these studies, peak 
KAM was assessed at baseline and related to structural 
medial disease progression assessed after a median of 
15 months from that baseline (12-72 months). In one 
study, 29 the baseline KAM impulse (area under the 
curve) was also related to medial disease progression. In 
the fifth study, 33 overall knee compression loading was 
used as load exposure. 

Two studies 30 31 defined structural disease progression 
as any worsening in semiquantitative radiographic 
medial joint space grade. 36 One study 32 defined struc- 
tural disease progression as any loss of medial femoral 
cartilage volume (quantitative MRI) above a measure- 
ment error previously established. In another study, 33 
structural disease progression was assessed from semi- 
quantitative grading of tibiofemoral cartilage loss and 
bone marrow lesions. In one study, 29 structural progres- 
sion was assessed semi-quantitatively as medial tibiofe- 
moral cartilage loss and medial tibiofemoral bone 
marrow lesions, and quantitatively as medial tibial cartil- 
age volume loss. For the meta-analysis, we extracted the 
association (OR) between peak KAM and semiquantita- 
tive progression in medial tibiofemoral cartilage loss. No 
ORs were available for the association between KAM and 
cartilage volume loss because both were continuous vari- 
ables. However, the linear regression analysis was 
extracted for the assessment of the biological gradient 
criteria. 



Association between joint loading and progression 

££1 ££££ ^K*} 

ORs were not reported in three studies, - but in 
individual patient data were reported and an OR was cal- 
culated. We contacted the authors of ref. 31 asking them 
to provide an OR if possible. The authors returned a 
conference abstract including an OR published 4 years 
prior to the article. 35 The abstract recorded 64 patients 
in contrast to the 57 patients included in ref. 31 The 
authors could not account for the difference in number 
of the subjects between the full paper and conference 
abstract. One study 33 grouped participants based on 
changes in joint loading following weight loss (increased 
vs reduced loads) and found no between group differ- 
ences in structural disease progression. Owing to the use 
of an intervention (weight loss), it was not appropriate 
to extract OR for baseline loading. 

The ORs for structural disease progression ranged 
from 0.40 to 6.46 for each unit higher baseline peak 
KAM magnitudes. From the random effects model, the 
combined OR was estimated to 1.90 (95% CI 0.85 to 
4.25). The confidence in this estimate was downgraded 
due to heterogeneity (I 2 =77%). Figure 2 summarises the 
individual and pooled estimates. 

In ref 29, the OR for progression of bone marrow 
lesions was 1.31 (95% CI 0.86 to 1.98) with every unit 
increment in baseline peak KAM; ORs for progression 
of cartilage defects and bone marrow lesions were 0.42 
(95% CI 0.12 to 1.48) and 1.8 (95% CI 0.63 to 5.17), 
respectively, with every unit increment in baseline KAM 
impulse. 

Methodological quality of the studies 

One or more criteria were limited in the assessments of 
risk of selection, detection and attrition biases. 
Individual risk assessments are presented in online 



4 



Henriksen M, etal. BMJ Open 2014;4:e005368. doi:1 0.1 1 36/bmjopen-201 4-005368 



e 



Open Access 



Table 1 


Summary of the characteristics of the included studies 






Author 

(year) 

country 


Follow-up 

Knee loading Structural time 
variable progression measure (months) 


Number of patients 
(total/prog ressors/ 
non-progressors) 


Females 

(%) Reference 



Miyazaki 
etal (2002) 
Japan 
Chang et al 
(2007) USA 

Bennell et al 

(2011) 

Australia 



Woollard 
et al (2011) 
USA 

Henriksen 
etal (2013) 
Denmark 



Peak KAM (unit: 
%BWxHT) 

Peak KAM (unit: 
%BWxHT) 

Peak KAM (unit: 
%BWxHT) 



KAM impulse 
(unit: %BWxHT) 

Peak KAM (unit: 
Nm/kg)t 

Peak overall 
knee 

compression 
force (unit: N) 



>1 grade 
according to 
Altman atlas 
>1 grade 
according to 
Altman atlas 
>1 grade 
medial 
tibiofemoral 
cartilage 
defects 
Cartilage 
volume loss 
(mm 3 ) 
Cartilage 
volume loss 
(mm 3 ) 

Cartilage loss 




sq 12 
MRI 



q 

MRI 



q 

MRI 

sq 
MRI 



12 



12 



74/42/32 



56 (64*)/41/15 



138/45/93 



144/NA/NA 



13/6/7 



157/NA/NA 



Median: 1 2 Totals: 452/1 34/1 47 



78 



59 



56 



23 



89 



31 
35* 



Mean: 67 



*On request, the authors forwarded a conference abstract with additional data based on 64 subjects. The number of progressors/ 
non-progressors was not available from that abstract. 

tWe converted the data into %BWxHT after requesting for body weight and height data from the authors. 
KAM, knee adduction moment; BW, body weight; HT, height; q, quantitative; sq, semiquantitative grading; NA, non-applicable. 



supplementary appendix A. Across the studies, the com- 
parability of the groups at baseline was generally inad- 
equately or unclearly described (selection bias). Also, 
descriptions of how reliably the joint load exposures 
were ascertained were not included in four of the five 
studies (detection bias), and group differences in loss of 
subjects were not accounted for (attrition bias). The dis- 
crepancy in subject numbers in reports from one 
cohort, 31 35 induced a high risk of selection and attrition 
bias. Table 2 presents summaries of the quality assess- 
ment. Agreement between quality raters was good 
(k=0.73 (95% CI 0.48 to 0.97)). 

Causality assessment 

The non-significant combined OR indicates no associ- 
ation between knee joint load during walking and struc- 
tural disease progression. Thus, a score of 0 in the 
Strength of Association criterion was assigned. The 



associations were not consistently replicated (50%), 
resulting in a score of 0 in the Consistency criterion. This 
finding was further supported by the heterogeneity 
between the studies (I 2 =77%; figure 2), and the discrep- 
ancies between baseline peak KAM magnitudes among 
progressors and non-progressors across the cohorts 
(figure 3). Temporal correctness was accepted in one 
cohort. 30 After requesting for additional information 
from the authors of one cohort study, 31 we discovered 
that the article reported only part of the study. The add- 
itional information retrieved 35 showed that the study 
consisted of three measurements taken every 18 months. 
Joint space and Kellgren-Lawrence grades (X-rays) were 
recorded at all three time points, whereas the KAM was 
obtained at time points 2 and 3 (18 and 36 months). 
KAM at time point 2 (18 months) was associated with 
structural disease progression from 18 to 36 months and 
structural disease progression from time points 1 to 2 



Figure 2 Forest plot of the 
individual ORs of structural 
disease progression with every 
increment in baseline peak knee 
adduction moment. Weights are 
from a random effects analysis. 
Individual and pooled estimates 
are shown with 95% CIs. 



Study or Subgroup log[Odds Ratio] SE Weight 



Odds Ratio 
IV, Random, 95% CI 



Odds Ratio 
IV, Random, 95% CI 



Miyazaki (2002) 
Chang (2007) 
Woollard (2011) 
Bennell (2011) 



1.866 
1.012 
-0.198 
0.14 



0.51 
0.47 
0.43 
0.22 



22.0% 
23.2% 
24.4% 
30.4% 



6.46 [2.38, 17.56] 
2.75 [1.10, 6.91] 
0.82 [0.35, 1.91] 
1.15 [0.75, 1.77] 



Total (95% CI) 1 00.0% 1 .90 [0.85, 4.25] 

Heterogeneity: Tau 2 = 0.51; Chi 2 = 13.29, df = 3 (P = 0.004); I 2 = 77% 
Test for overall effect: Z = 1.55 (P = 0.12) 



0.01 0.1 1 
Joint loads protective 



10 100 
Joint loads harmful 
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Table 2 Summary of the quality assessment (risk of bias) 



Bias 
type 



Quality criteria 



Miyazaki 
etaP 0 



Chang 
etaP" 35 



Bennell 

etaP 



Woollard 

etaP 2 



Henriksen 

etaP 3 



Selection Were the descriptions of 
the groups and the 
distribution of prognostic 
factors sufficient? 
Were the groups 
assembled at a similar 
point in their disease 
progression? 
Were the groups 
comparable on all 
important confounding 
factors? 

Detection Was the joint load estimate 
reliably ascertained? 
Was adequate adjustment 
made for the effects of 
these confounding 
variables? 

Was outcome assessment 
blind to exposure status? 
Was follow-up long enough 
for the outcomes to occur? 
What proportion of the 
cohort was followed-up?* 
Attrition Were dropout rates and 
reasons for drop-out 
similar across groups? 
Overall risk of biast 



Methodological quality 



Very serious 
limitations; 
high risk of 
bias 
Low 



I 



A 
A 
A 
I 



Very serious 
limitations; 
high risk of 
bias 
Low 



U 
A 



A 
U 



Very serious 
limitations; 
high risk of 
bias 
Low 



A 
A 
A 



Very serious 
limitations; 
high risk of 
bias 
Low 



A 
A 
A 
A 



Very serious 
limitations; 
high risk of 
bias 
Low 



*Adequate=follow-up proportion >80%; unclear 50-80%; inadequate=<50%. 

fRisk of bias within studies is assessed using GRADE'S approach to study limitations: No serious limitation defined as all criteria being 
adequately described (high methodological quality); serious limitations defined as one criterion being inadequately described or >1 criterion 
being unclearly described (moderate methodological quality); very serious limitation defined as >1 criterion being inadequately described (low 
methodological quality). 

A, adequately described; I, inadequately described; U, unclear. 



(baseline to 18 months) was associated with increased 
KAM from 18 to 36 months. 35 Thus, temporal correct- 
ness cannot be assigned to the observed associations in 
this study, and the overall Temporality criterion was not 
satisfied (a 0 score). Biological Gradient was investigated 
in one cohort 29 that showed no positive association 
between baseline peak KAM and loss of medial tibial 
cartilage volume over 12 months, resulting in a score of 
0. Consequently, the causation score based on the 
included cohort studies was 0 for the association 
between peak KAM during walking and structural 
disease progression in patients with knee OA (table 3). 

For the KAM impulse and overall knee compression 
loading, only one study was available for each expos- 
ure, 29 33 and therefore no causation scores were calcu- 
lated for these loading exposures. 

As the systematic literature search did not identify any 
RCTs, the findings from the cohort studies can neither 



be confirmed nor refuted by experimental evidence. 
Hence, the fifth criterion could not be used to upgrade 
or downgrade the causation scores. 

DISCUSSION 

This review is the first to systematically assess — using sys- 
tematic review, meta-analysis, and the Bradford Hill con- 
siderations on causality — whether a causal link exists 
between knee joint loading during walking and struc- 
tural disease progression of knee OA. The systematic lit- 
erature search identified five prospective cohort studies 
focusing on three estimates of knee joint loading: the 
peak KAM, KAM impulse and overall tibiofemoral com- 
pression force. Using a predefined algorithm, a caus- 
ation score of 0 was reached, showing that there is no 
evidence of a causal link between knee joint loading 
during walking and structural progression of knee OA. 
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■ Miyazaki et al. 

□ Chang et al. 

□ Bennell etal. 

■ WoolLard etal. 




Progressors 



Non-progressors 



Figure 3 The mean baseline knee adduction moment (KAM) 
knee joint load exposures in patients with and without 
structural disease progression from the four individual cohorts. 
Note the overlap between progressors and non-progressors in 
the different studies. Error bars: 95% CI. 

Furthermore, no experimental data from RCTs exist to 
support or oppose the findings from the cohort studies. 
One of the prospective cohort studies 33 compared 
increased versus decreased loads, without demonstrating 
differences in structural disease progression opposing a 
causal link. 

Indirectly supporting the current findings, a longitu- 
dinal randomised trial comparing laterally wedged 
insoles ( previously demonstrated to reduce loading 37 38 ) 
with placebo insoles, showed no group differences in 
structural disease progression. 39 While that study seem- 
ingly provided experimental evidence, no joint load 
exposure data were included to document the assertion 
that joint loadings were in fact reduced in the 'wedged 
insole' group. Thus, the study did not fulfil the criteria 
for inclusion in this review. A recent meta-analysis on 
symptomatic effects of laterally wedged insoles showed 
no beneficial effects over neutral insoles, indicating 



limited effects of laterally wedged insoles. 40 However, the 
null outcome is supported by the results in ref. 33, 
included in this review, showing neither detrimental 
effects of increased loading nor beneficial effects of 
decreased loading over 12 months in a group of knee 
OA patients. 

The quality assessment of the included studies showed 
high risks of selection, detection and attrition biases 
(table 3). An important precondition for valid results in 
cohort studies is that groups should be comparable at 
baseline. Unless the groups are balanced for relevant 
baseline characteristics, differences in structural disease 
progression cannot confidently be attributed to the 
effects of interest (joint loads during walking). The 
included studies revealed significant inadequacies when 
assessing the comparability of the groups on relevant 
baseline characteristics, leading to prognostic imbalance. 
Four of the five cohort studies did not report how reli- 
ably the joint loading estimates were ascertained (detec- 
tion bias), which could influence the individual results. 
Moreover, the loss of subjects at follow-up in two of the 
five cohort studies was beyond 20%, with the dropout 
rates and reasons for dropout across groups not 
adequately accounted for (attrition bias). Finally, the dis- 
crepant numbers of subjects presented in different 
reports from the same cohort 31 35 introduced a serious 
risk of bias and low confidence in the results. The 
overall low methodological quality of the included 
studies gives little confidence in the reported estimates 
of causation. 

Despite the shortage of studies reporting a causal link 
between joint loading and structural disease progression, 
the conjectured relationship underlies the rationale for 
many clinical efforts to reduce knee loading during 
walking in attempts to retard disease progression. 
Weight loss, insoles, modified footwear, strengthening 
exercises, knee braces and gait modifications have been 



Table 3 Causation criteria and scores for the identified knee joint loading exposures 

Strength* 

Summary Consistency! Temporality^ 

Knee joint load exposure or OR (95% CI) N (%) N (%) 

Peak KAM 1 .90 (0.85 to 4.25) 2/4 (50%) 1/4 (25%) 

KAM impulse 0.42 (0.12 to 1.48)H -** 0/1 (0%) 

Peak overall compression No group difference -** 1/1 (100%) 

*Strong association is defined as a pooled OR >5 with lower 95% CI excluding 2.0. Moderate association is any statistically significant 
association. 

tConsistency is defined as >75% of associations showing strong or moderate associations. 

jln cohort studies it is difficult if not impossible to ensure temporal correctness because participants in the studies are expected to walk 
daily and are, therefore, exposed to knee joint loading throughout the observation period. In the current analysis, the temporality criterion is 
satisfied in studies that relate baseline knee joint loading exposures to disease progression over time from that baseline and demonstrate a 
statistically significant association with structural disease progression. 

§Biological gradient is defined as demonstrated when the rate of progression increases (or decreases) incrementally as dose of exposure 
increases. 

HEstimate based on one cohort study 29 assessing baseline KAM impulse to progression of cartilage defects assessed by semiquantitative 
grading of MRI. 

**Consistency not possible to assess with only one cohort. 
ttCausality score not calculated because only one study available. 
KAM, knee adduction moment. 
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suggested as means to lower knee joint loads and thus 
slow structural disease progression through mechanical 
pathways, 41-46 with effects yet to be proven. The ration- 
ale for such efforts is diluted by the current results, not 
only by the low causality score, but also by the inconsist- 
ency across studies in the loading magnitudes between 
individuals who progress and those that do not. Figure 3 
illustrates that patients with a certain baseline peak KAM 
magnitude in one study progress, whereas those with 
similar baseline peak KAM magnitudes in other studies 
do not. These differences in knee joint loading magni- 
tudes between studies may be due to differences in gait 
analysis protocols across the involved gait labs, for 
example, in the selection of reference frame convention 
for the calculation of the KAM 47 and footwear condi- 
tion. 44 However, including diversity of measures of the 
same variable can strengthen the confidence in the 
results of a systematic review. 

The results from the included studies exhibited a high 
degree of heterogeneity (I 2 from the meta-analysis) 
(figure 2). The two cohorts with significant OR 30 31 35 
both used radiography as the structural outcome assess- 
ment, whereas the studies reporting non-significant ORs 
used MRI. 29 32 Furthermore, the two studies reporting 
significant associations had longer follow-up periods (72 
and 18 months), whereas the studies without significant 
associations assessed disease progression after 
12 months. This concurrence of follow-up time and 
imaging modality precludes conclusions about whether 
the differences in results are based on follow-up time or 
imaging modality. Radiography is the current clinical 
gold standard for assessing structural disease progres- 
sion, but it has low temporal sensitivity, making long 
follow-up times necessary. MRI — with its shorter tem- 
poral sensitivity — is possibly the more promising tool for 
assessing single structures and should be used in studies 
with longer follow-up times. The total number of 
patients was 452 in five cohorts; that is a very small 
number of patients when considering the large popula- 
tion and when compared to similar reviews in other 
chronic diseases. For instance, a recent review of the 
causal relationship between dietary factors and coronary 
heart disease included several millions of individuals in 
361 cohorts and a multitude of dietary exposures. 23 

Our study has strengths because we undertook several 
measures to minimise bias, restricting our review to 
studies with the strongest causal inference (cohort 
studies and RCTs), conducted independent assessments 
of study eligibility, and using predefined criteria to evalu- 
ate causation and methodological quality. We may be cri- 
ticised for using arbitrary OR cut-offs to define strong 
and moderate association, but identifying the true 
cut-off for defining clinically meaningful OR values is 
impossible in a field that has few longitudinal studies to 
consult. An important limitation is the causality score 
that applied. The score is a modification of a previous 
similar score, 23 thus it has limited empirical evidence to 
support its validity. Also, the causality score omitted 



several of the original causation considerations as pro- 
posed by Hill, 21 yet the applied method was prespeci- 
fied. We may also be criticised for analysing any 
image-based structural deterioration, rather than separ- 
ating the analyses in X-ray and MRI-based assessments. 
However, the analysis plans were also prespecified, 
without prior knowledge of the number of studies and 
specifics of the outcomes. 

Current guidelines on interpretation of data from 
observational studies and data combined from repeated 
observational studies (with a reasonable internal validity) 
would correspond to confidence in the estimate's being 
'Low-quality evidence', but could be upgraded if there is 
great confidence in the estimates and the effects are 
large. 48 However, the current analysis revealed a large 
inconsistency among study results, poor precision of the 
combined analysis, and low methodological quality. 
Thus, we downgraded further to 'very low quality evi- 
dence'. The inconsistency of the available studies may 
limit the appropriateness of meta-analyses, yet support 
the overall conclusion of this systematic literature review. 

In conclusion, our systematic review, meta-analyses, 
and application of Bradford Hill's considerations on 
causation shows that there is no evidence of a causal link 
between estimates of knee joint loading during walking 
and structural progression of knee OA. Future large well- 
designed prospective cohorts with subsequent confirm- 
ation from randomised trials are strongly recommended. 
The current study findings question the rationale 
behind clinical efforts to reduce knee joint loading 
during walking as such efforts are currently supported 
by very low-quality evidence showing no causal link 
between knee joint loading during walking and progres- 
sion of knee OA. 
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